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DETAILED ACTION 

1. Claims 1-21 have been examined. 

Papers Received 

2. Receipt is acknowledged of amendment papers submitted, where the papers 
have been placed of record in the file. 

3. The amendment has successfully overcome the claim objection as well as the 35 
USC 112 rejections both of which are herein withdrawn. 

4. Though the original objection to the specification has been withdrawn and is 
withdrawn, a new objection to the specification remains as stated below. 

Specification 

5. The disclosure is objected to because of the following informalities: the 
amendment to the specification is not of proper format. The word "April" in paragraphs 
[0002] and [0003] has been added but not underlined as called for by the revised 
amendment practice of 37 CFR 1.121. 

Appropriate correction is required. 

Claim Rejections - 35 USC § 102 

6. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - t 

(a) the invention was known or used by others in this country, or patented or described in a printed 
publication in this or a foreign country, before the invention thereof by the applicant for a patent. 

7. Claims 1-7, 15-17 and 19-21 are rejected under 35 U.S.C. 102(a) as being 
anticipated by Rotenberg (AR-SMT). 
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8. In regard to claim 1, Rotenberg discloses a computer system, comprising: 

.a. a pipelined, simultaneous and redundantly threaded ("SRT") processor 
comprising at least one data cache and a slack counter; [In the second column of 
section 1 .1 , the system is shown to run two explicit copies of the program run 
concurrently on the same processor resources. Further down it is shown that 
this is possible using simultaneous multithreading. Thus two copies of the same 
program (the term redundant is used on the top of the next page) are run in these 
simultaneous threads. Figure 5 shows that the system is broken up into pipeline 
stages and is thus pipelined. This figure, as well as figure 4, shows that the 
system includes a data cache or d-cache. The delay buffer of figure 2 functions 
as the slack counter as described more fully below.] 

b. a main system memory coupled to said processor; [As shown above there 
is a system memory coupled to the processor through the disambiguation unit.] 

c. and wherein said SRT processor processes a set of instructions in a 
leading thread and also in a redundant trailing thread to detect transient faults in 
the computer system; [Section 1 .2 shows that the disclosed system comprises 
multiple threads (streams) that run the same program at the same time with one 
thread having a slight lag and thus a set of instructions is run in a leading and 
trailing thread.] 

d. and wherein when a data load command appears in the leading thread, 
the processor loads the requested data and replicates the value for the 
corresponding data load command in the trailing thread; [The paragraph 
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immediately under figure 3 shows that a delay buffer contains data and control 
information from the first run of the program (on the leading thread or A-stream). 
This is a clear differentiation of the result data and other control information such 
as that for predictions. In the paragraph directly above figure 3, it is shown that 
source operands (data) of instructions are predicted. In the paragraph directly 
below the figure again, it is stated that the delay buffer provides the R-stream 
(trailing thread) with perfect data flow prediction. Therefore, perfect data operand 
prediction must be provided, and thus the data operands themselves provided 
(replicated as the control information) in this delay buffer for the trailing thread.] 
e. and wherein the slack counter is used to cause instructions in the trailing 
thread to lag behind corresponding instructions in the leading thread. [The slack 
counter or delay buffer of figure 2 is described in section 2.1 .4. Here it is shown 
that the trailing thread (R-stream) lags behind the leading thread (A-stream), 
which is running ahead. The section also shows that this delay or lag is 
controlled by the length of the delay buffer and thus the delay buffer is 
functionally equivalent to the slack counter and it keeps track or counts the 
number of instructions or cycles the threads are separated by as instructions 
from the leading thread pass through it.] 
9. In regard to claim 2, Rotenberg discloses the computer system of claim 1 further 
comprising a load value queue; wherein when the processor loads the requested data, 
the processor stores the same data in the load value queue. [As shown above, the 
loaded operand data is stored in a delay buffer. The first paragraph of section 1 .2 
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shows that this buffer is in fact a queue and thus another appropriate name for the 
structure is a load value queue.] 

1 0. In regard to claim 3, Rotenberg discloses the computer system of claim 2 
wherein the processor does not store the data value in the load value queue until the 
data load command in the leading thread commits. [Figure 2 and the first paragraph of 
section 1.2 shows that the data stored in the load value queue (delay buffer) is pushed 
there when the results of the A-stream (leading thread) are committed.] 

11. In regard to claim 4, Rotenberg discloses the computer system of claim 2 
wherein the load value queue is a FIFO buffer and wherein all data load commands in 
the trailing thread are executed by the processor in their original, program order. [The 
first paragraph of section 1 .2 show that the load value queue is a FIFO delay buffer. 
Though section 1.3.1 details that instructions in the trailing thread (R-stream) may be 
executed highly in parallel due to perfect prediction, the instructions are not executed 
out-of-order, as can be seen in figure 3. Therefore the trailing thread executes 
instructions including the data load commands in program order.] 

12. In regard to claim 5, Rotenberg discloses the computer system of claim 2 
wherein the data load command is a request for data in the data cache. [As shown 
above, a data cache exists in the system. The load commands have been shown 
above to be requests for data operands. It is inherent that a data cache first requests 
data from a data cache because its whole purpose of existence is to be a fast location 
for data retrieval. Further proof is given in the included IEEE dictionary term for "hit" 
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(shown to be a synonym for cache hit). Here it shows that referencing data in cache 
eliminates the need to go to a secondary storage.] 

13. In regard to claim 6, Rotenberg discloses the computer system of claim 2 
wherein the data load command is a request for data in the main system memory. [As 
shown above, a data cache exists in the system. The load commands have been 
shown above to be requests for data operands. It is inherent that a data cache first 
requests data from a data cache because its whole purpose of existence is to be a fast 
location for data retrieval. It is also inherent that if the data cache misses (the data is 
not in the cache), the data will be requested from the main system memory. Further 
proof is given in the included IEEE dictionary term for "hit" (shown to be a synonym for 
cache hit). Here it shows that referencing data in cache eliminates the need to go to a 
secondary storage, thus meaning that on a miss, the secondary storage is referenced. 
Since the disclosure of Rotenberg only discloses two data storage locations, the data 
cache and main memory, both shown above, the secondary storage is main system 
memory and thus is referenced.] 

14. In regard to claim 7, Rotenberg discloses the computer system of claim 4 
wherein if the load value queue becomes full, execution of instructions in the leading 
thread is temporarily halted to prevent more data values from entering the load value 
queue; and wherein if the load value queue becomes empty, execution of instructions in 
the second thread is temporary halted to allow more data values to enter the load value 
queue. [Near the end of the summary section (section 4), it is disclosed that if the delay 
buffer (load value queue) is full, the R-stream (trailing thread) is given priority to access 
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the fetch/dispatch pipeline, meaning that the A-stream or leading thread is temporarily 
halted in favor of the trailing thread so that it can issue and dispatch instructions. It is 
also disclosed that if the delay buffer (load value queue) is not full (where empty is an 
instance of this), the A-stream (leading thread) is given priority retire a trace, meaning 
that the R-stream or trailing thread is temporarily halted in favor of the leading thread 
when it can retire a trace of instructions.] 

15. In regard to claim 15, Rotenberg discloses a method of replicating data values in 
an SRT processor which can fetch and execute a program set in two separate threads 
so that each thread includes substantially the same instructions as the other thread, one 
of said threads being a leading thread and the other of said threads being a trailing 
thread; [In the second column of section 1 .1 , the system is shown to run two explicit 
copies of the program run concurrently on the same processor resources. Further down 
it is shown that this is possible using simultaneous multithreading. Thus two copies of 
the same program (the term redundant is used on the top of the next page) are run in 
these simultaneous threads. Section 1 .2 shows that the disclosed system comprises 
multiple threads (streams) that run the same program at the same time with one thread 
having a slight lag and thus a set of instructions is run in a leading and trailing thread.] 
the method comprising: 

a. implementing a counter to cause instructions in the leading thread to 
execute ahead of corresponding instructions in the trailing thread; [The delay 
buffer of figure 2 is described in section 2.1 .4. Here it is shown that the trailing 
thread (R-stream) lags behind the leading thread (A-stream), which is running 
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ahead. The section also shows that this delay or lag is controlled by the length of 
{ the delay buffer and thus the delay buffer is functionally equivalent to a counter 
and it keeps track or counts the number of instructions or cycles the threads are 
separated by as instructions from the leading thread pass through it.] 

b. accessing a data source to load a data value when the leading thread 
requests said data; [It is inherent that when the leading thread requests data that 
it requests it from a data source of some sort.] 

c. storing the data value in a load value queue; accessing the load value 
queue for the data value for corresponding data requests in the trailing thread. 
[The paragraph immediately under figure 3 shows that a delay buffer contains 
data and control information from the first run of the program (on the leading 
thread or A-stream). This is a clear differentiation of the result data and other 
control information such as that for predictions. In the paragraph directly above 
figure 3, it is shown that source operands (data) of instructions are predicted. In 
the paragraph directly below the figure again, it is stated that the delay buffer 
provides the R-stream (trailing thread) with perfect data flow prediction. 
Therefore, perfect data operand prediction must be provided, and thus the data 
operands themselves provided (replicated as the control information) in this delay 
buffer for the trailing thread. The first paragraph of section 1 .2 shows that this 
buffer is in fact a queue and thus another appropriate name for the structure is a 
load value queue.] 
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16. In regard to claim 16, Rotenberg discloses the method of claim 15 further 
comprising: executing the data requests in the trailing thread in program order. [Though 
section 1.3.1 details that instructions in the trailing thread (R-stream) may be executed 
highly in parallel due to perfect prediction, the instructions are not executed out-of-order, 
as can be seen in figure 3. Therefore the trailing thread executes instructions including 
the data load commands in program order.] 

17. In regard to claim 17, Rotenberg discloses the method of claim 16 further 
comprising: storing the data values in the load value queue after the data requests in 
the leading thread execute. [Figure 2 and the first paragraph of section 1 .2 shows that 
the data stored in the load value queue (delay buffer) is pushed there when the results 
of the A-stream (leading thread) are committed (and thus have executed).] 

18. In regard to claim 19, Rotenberg discloses the method of claim 18 wherein the 
data source is a data cache. [As shown in figures 4 and 5, a data cache exists in the 
system. The load commands have been shown above to be requests for data 
operands. It is inherent that a data cache first requests data from a data cache because 
its whole purpose of existence is to be a fast location for data retrieval. Further proof is 
given in the included IEEE dictionary term for "hit" (shown to be a synonym for cache 
hit). Here it shows that referencing data in cache eliminates the need to go to a 
secondary storage.] 

19. In regard to claim 20, Rotenberg discloses the method of claim 18 wherein the 
data source is system, memory source. [As shown above, a data cache exists in the 
system. The load commands have been shown above to be requests for data 
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operands. It is inherent that a data cache first requests data from a data cache because 
its whole purpose of existence is to be a fast location for data retrieval. It is also 
inherent that if the data cache misses (the data is not in the cache), the data will be 
requested from the main system memory. Further proof is given in the included IEEE 
dictionary term for "hit" (shown to be a synonym for cache hit). Here it shows that 
referencing data in cache eliminates the need to go to a secondary storage, thus 
meaning that on a miss, the secondary storage is referenced. Since the disclosure of 
Rotenberg only discloses two data storage locations, the data cache and main memory, 
both shown above, the secondary storage is main system memory and thus is 
referenced.] 

20. In regard to claim 21 , Rotenberg discloses the method of claim 17 further 
comprising: transmitting data to and from the load value queue using an error correction 
technique. [The last two paragraphs of section 1 .2 shows that the entire delay buffer 
(load value queue) concept is for checkpoint recovery (an error correction technique). 
Therefore, loads to and from the buffer use an error correction technique.] 

Claim Rejections - 35 USC § 103 

21 . The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

22. Claims 8-12 and 14 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Rotenberg in view of Tullsen (Simultaneous Multithreading). 
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23. In regard to claim 8, 

a. Rotenberg discloses 

i. a pipelined, simultaneous and redundantly threaded ("SRT") 
processor, [In the second column of section 1.1, the system is shown to 
run two explicit copies of the program run concurrently on the same 
processor resources. Further down it is shown that this is possible using 
simultaneous multithreading. Thus two copies of the same program (the 
term redundant is used on the top of the next page) are run in these 
simultaneous threads. Figure 5 shows that the system is broken up into 
pipeline stages and is thus pipelined. This figure, as well as figure 4, 
shows that the system includes a data cache or d-cache.] 

ii. comprising a program counter configured to assign program count 
identifiers to instructions in each thread that are retrieved by the 
processor; [Section 2.1 .3 shows that a program counter is required for 
each thread, which inherently assign a program count identifier to 
instructions in the thread.] 

iii. a counter that causes instructions in a copy of program thread to 
begin execution after corresponding instructions in another copy of the 
program thread begin execution; [Section 1 .2 shows that the disclosed 
system comprises multiple threads (streams) that run the same program 
at the same time with one thread having a slight lag and thus a set of 
instructions is run in a leading and trailing thread. The delay buffer of 
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figure 2 is described in section 2.1 .4. Here it is shown that the trailing 
thread (R-stream) lags behind the leading thread (A-stream), which is 
running ahead. The section also shows that this delay or lag is controlled 
by the length of the delay buffer and thus the delay buffer is functionally 
equivalent to a counter and it keeps track or counts the number of 
instructions or cycles the threads are separated by as instructions from the 
leading thread pass through it.] 

iv. wherein said processor is configured to detect transient faults 
during program execution by executing instructions the copies of the 
program thread and wherein false errors caused by incorrectly replicating 
fetched data in the redundant program threads are avoided by replicating 
the actual data retrieved from data fetch instructions in a first program 
thread for a second program thread. [Section 1 .2 shows that the 
disclosed system comprises multiple threads (streams) that run the same 
program at the same time with one thread having a slight lag and thus a 
set of instructions is run in a leading and trailing thread. The paragraph 
immediately under figure 3 shows that a delay buffer contains data and 
control information from the first run of the program (on the leading thread 
or A-stream). This is a clear differentiation of the result data and other 
control information such as that for predictions. In the paragraph directly 
above figure 3, it is shown that source operands (data) of instructions are 
predicted. In the paragraph directly below the figure again, it is stated that 
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the delay buffer provides the R-stream (trailing thread) with perfect data 
flow prediction. Therefore, perfect data operand prediction must be 
provided, and thus the data operands themselves provided (replicated as 
the control information) in this delay buffer for the trailing thread.] 

b. Rotenberg does not disclose 

i. floating point execution units configured to execute floating point 
instructions; 

ii. integer execution units configured to execute integer-based 
instructions; 

iii. load/store units configured to perform fetch and store operations to 
or from data sources such as a data cache, memory, and data registers; 

[However, Rotenberg does disclose multiple execution units (processing 
elements) as can be seen in figures 4 and 5.] 

c. Tullen has taught a simultaneous multithreading system comprising 

i. floating point execution units configured to execute floating point 
instructions; 

ii. integer execution units configured to execute integer-based 
instructions; 

iii. load/store units configured to perform fetch and store operations to 
or from data sources such as a data cache, memory, and data registers; 
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[The last paragraph of page 393 shows that floating point, integer, and load/store 
units are included in the system for executing floating point, integer, and 
load/store instructions as given in table 1 .] 

d. Rotenberg has shown in section 2.1 that most of his design is derived 
from simultaneous multithreaded machines with the Tullen reference in particular 
(as seen in the bibliography to be reference 10) for techniques that are well 
established and understood and are seamlessly incorporated. This reference to 
Tullen for details of the design not disclosed by Rotenberg would have motivated 
one of ordinary skill in the art to modify the design of Rotenberg to include the 
details regarding the functional units disclosed by Tullen. 
It would have been obvious to one of ordinary skill in the art to modify the design of 
Rotenberg to include the various functional units taught by Tullen because of the 
reference to this disclosure by Rotenberg concerning the details of his design. 
24. In regard to claim 9, Rotenberg in view of Tullen discloses the SRT processor of 
claim 8 wherein the processor further comprises: 

a. a load value queue for storing the data values fetched in response to data 
fetch instructions in the first program thread; [As shown above, the loaded 
operand data is stored in a delay buffer. The first paragraph of section 1 .2 of 
Rotenberg shows that this buffer is in fact a queue and thus another appropriate 
name for the structure is a load value queue.] 

b. wherein the load/store units place a duplicate copy of the data in the load 
value queue after fetching the data from the data source and wherein the 
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load/store units access the load value queue and not the data source to fetch 
data values in response to data fetch instructions in the second program thread. 
[Since all instructions store their data operands in the load value queue for 
correct operand prediction, load and store instructions also store data received in 
this queue and thus the load/store unit places the duplicate data in the queue. 
As shown above, the trailing thread fetches all data values from the load value 
queue (delay buffer) instead of the original data source. Therefore, the load and 
store instructions will look to this queue for data and thus the load/store unit 
accesses the queue.] 

25. In regard to claim 10, Rotenberg in view of Tullen discloses the SRT processor of 
claim 9 further comprising a register update unit; wherein the register update unit is 
configured to hold instructions in a queue until the instructions are executed and retired 
by the SRT processor and wherein the data fetched by the load/store units in response 
to data fetch instructions in the first program thread is not placed in the load value 
queue until the data fetch instructions in the first program thread retire from the register 
update unit. [Section 2.1.1 of Rotenberg introduces a register-renaming scheme where 
multiple physical registers are mapped to the logical registers, by a register map, for the 
purpose of resolving data dependencies. Since the physical registers hold data (and 
thus are queues) to update the logical registers once all dependencies have been 
resolved, this structure is a register update unit. Figure 2 and the first paragraph of 
section 1 .2 of Rotenberg show that the data stored in the load value queue (delay 
buffer) is pushed there when the results of the A-stream (leading thread) are committed 
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or retired. Thus the data values must be stored in the register update unit until the 
instructions are retired and updates commence to both the registers and the load value 
queue.] 

26. In regard to claim 1 1 , 

a. Rotenberg in view of Tullen discloses the SRT processor of claim 9 
wherein data fetch instructions are executed in the same order in both the first 
and second program threads. [Though section 1 .3.1 details that instructions in 
the trailing thread (R-stream) may be executed highly in parallel due to perfect 
prediction, the instructions are not executed out-of-order, as can be seen in 
figure 3. Therefore the trailing thread executes instructions including the data 
load commands in program order, as did the leading thread.] 

b. Rotenberg in view of Tullen as applied to claim 8 does not disclose 
wherein the SRT processor is an out-of-order processor capable of executing 
instructions in the most efficient order. 

c. Tullen does disclose a simultaneous multithreaded processor that is an 
out-of-order processor capable of executing instructions in the most efficient 
order. Column 1 , paragraph 2 on page 394 shows that out-of-order execution 
can be done and is supported and thus the processor is capable of out-of-order 
execution. 

d. Rotenberg has shown in section 2.1 that most of his design is derived 
from simultaneous multithreaded machines with the Tullen reference in particular 
(as seen in the bibliography to be reference 10) for techniques that are well 
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> established and understood and are seamlessly incorporated. This reference to 
Tullen for details of the design not disclosed by Rotenberg would have motivated 
one of ordinary skill in the art to modify the design of Rotenberg to include the 
capability of out-of-order execution disclosed by Tullen. 
It would have been obvious to one of ordinary skill in the art to modify the design of 
Rotenberg to include the capabilities of out-of-order execution by Tullen because of the 
reference to this disclosure by Rotenberg concerning the details of his design. 

27. In regard to claim 12, Rotenberg in view of Tullen discloses the SRT processor of 
claim 1 1 wherein the load value queue is a FIFO buffer and data is transmitted to and 
from the buffer using an error correction technique. [The first paragraph of section 1 .2 
show that the load value queue is a FIFO delay buffer. The next two paragraphs of 
section 1 .2 shows that the entire delay buffer (load value queue) concept is for 
checkpoint recovery (an error correction technique). Therefore, loads to and from the 
buffer use an error correction technique.] 

28. In regard to claim 14, Rotenberg in view of Tullen discloses the SRT processor of 
claim 12 wherein if the load value queue becomes full, the first thread is stalled to 
prevent more data values from entering the load value queue; and wherein if the load 
value queue becomes empty, the second thread is stalled to allow data values to enter 
the load value queue. [Near the end of the summary section (section 4), it is disclosed 
that if the delay buffer (load value queue) is full, the R-stream (trailing thread) is given 
priority to access the fetch/dispatch pipeline, meaning that the A-stream or leading 
thread is temporarily halted in favor of the trailing thread so that it can issue and 
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dispatch instructions. It is also disclosed that if the delay buffer (load value queue) is 
not full (where empty is an instance of this), the A-stream (leading thread) is given 
priority retire a trace, meaning that the R-stream or trailing thread is temporarily halted 
in favor of the leading thread when it can retire a trace of instructions.] 

29. Claim 13 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Rotenberg in view of Tullen as applied to claims 11-12 above, and further in view of 
Merchant (6,665,792). 

30. In regard to claim 13, 

a. Rotenberg in view of Tullen discloses the SRT processor of claim 1 2 
wherein the individual load value entries in the load value queue comprise: the 
data source from which the data was fetched; [Since, as shown above, the 
trailing thread executes the same instructions as the leading thread.] 

i. an address indicating the physical location in the data source from 
which the data was fetched; an address indicating the physical location in 
the trailing thread inherently locates the correct data with the same 
address. [Thus the address must be stored in the buffer.] 

ii. and the data value that was retrieved by the load/store units when a 
data fetch instruction was executed (as shown above in regards to the 
independent claim). 

b. Rotenberg in view of Tullen does not disclose that the size of the data 
value is stored in the buffer; 
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c. Merchant has taught in column 6, lines 10-13 show that a load buffer 
stores the size of the data. 

d. The system of Rotenberg in view of Tullen, as shown above, stores 
operand data in the buffers. In order to handle the varying sizes of operand data 
for execution, a field indicating the size of the operand would speed up 
recognition on how to handle the data. This ability to quickly recognize operand 
size would have motivate one of ordinary skill in the art to modify the design of 
Rotenberg in view of Tullen to include the data size in the load buffer as taught 
by Merchant. 

It would have been obvious to one of ordinary skill in the art at the time of invention to 
modify the design of Rotenberg in view of Tullen to store the data size in the load buffer 
so that operand sizes are quickly recognized and the operands handled correctly. 

31. Claim 18 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Rotenberg in view of Merchant (6,665,792). 

32. , In regard to claim 18, 

a. Rotenberg discloses the method of claim 1 7 further comprising: 

i. storing the data value in a FIFO buffer; [The first paragraph of 
section 1 .2 show that the load value queue is a FIFO delay buffer.] 

ii. and storing the following information with each entry in the buffer: 

(1 ) an address indicating the physical location in the data source 
from which the data was fetched; [Since, as shown above, the 
trailing thread executes the same instructions as the leading thread, 
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the trailing thread inherently locates the correct data with the same 

address. Thus the address must be stored in the buffer.] 

(2) and the data value that was retrieved by the data request in 

the leading thread (as shown above in regards to the independent 

claim). 

b. Rotenberg does not disclose that the size of the data value is stored in the 
buffer; 

c. Merchant has taught in column 6, lines 10-13 show that a load buffer 
stores the size of the data. 

d. The system of Rotenberg, as shown above, stores operand data in the 
buffers. In order to handle the varying sizes of operand data for execution, a field 
indicating the size of the operand would speed up recognition on how to handle 
the data. This ability to quickly recognize operand size would have motivate one 
of ordinary skill in the art to modify the design of Rotenberg to include the data 
size in the load buffer as taught by Merchant. 

It would have been obvious to one of ordinary skill in the art at the time of invention to 
modify the design of Rotenberg to store the data size in the load buffer so that operand 
sizes are quickly recognized and the operands handled correctly. 

Response to Arguments 
33. Applicant's arguments filed 03 June 2004 have been fully considered but they are 
not persuasive. 
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34. Applicant has argued in pages 9-10 that Rotenberg does not teach or suggest 
the use of a slack counter that causes instructions in the trailing thread to lag behind 
corresponding instructions in the leading thread. Applicant has further stated that at 
most Rotenberg only teaches a FIFO buffer. This FIFO buffer would be the delay buffer 
of figure 2. The delay buffer of figure 2 is described in section 2.1.4. Here it is shown 
that the trailing thread (R-stream) lags behind the leading thread (A-stream), which is 
running ahead. The section also shows that this delay or lag is controlled by the length 
of the delay buffer and thus the delay buffer is functionally equivalent to a counter and it 
keeps track or counts the number of instructions or cycles the threads are separated by 
as instructions from the leading thread pass through it. Therefore, Rotenberg in fact 
does teach the use of a slack counter that causes instructions in the trailing thread to 
lag behind corresponding instructions in the leading thread. 

Conclusion 

35. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
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the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 

36. The following is text cited from 37 CFR 1.111 (c): In amending in reply to a 
rejection of claims in an application or patent under reexamination, the applicant or 
patent owner must clearly point out the patentable novelty which he or she thinks the 
claims present in view of the state of the art disclosed by the references cited or the 
objections made. The applicant or patent owner must also show how the amendments 
avoid such references or objections. 

37. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. The references cited in the previous action remain pertinent and 
remain cited herein. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Shane F Gerstl whose telephone number is (571) 272- 
4166 after October 12 and (703) 305-7305 before October 12. The examiner can 
normally be reached on M-F 6:45-4:15 (First Friday Off). 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Eddie Chan can be reached on (571) 272-4162 after October 12 and (703) 
305-9712 before October 12. The fax phone number for the organization where this 
application or proceeding is assigned is 703-872-9306. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-21 7-91 97 (toll-free). 



Shane F Gerstl 
Examiner 
Art Unit 2183 
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August 25, 2004 




